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Abstract. Let S(X,B) be a symmetric ("palindromic") word in two letters 
X and B. A theorem due to Hillar and Johnson states that for each pair of 
positive definite matrices B and P, there is a positive definite solution X to 
the word equation S(X,B) = P. They also conjectured that these solutions 
are finite and unique. In this paper, we resolve a modified version of this 
conjecture by showing that the Brouwer degree of such an equation is equal 
to 1 (in the case of real matrices). It follows that, generically, the number of 
solutions is odd (and thus finite) in the real case. Our approach allows us to 
address the more subtle question of uniqueness by exhibiting equations with 
multiple real solutions, as well as providing a second proof of the result of 
Hillar and Johnson in the real case. 



1. Introduction 

In this paper, we consider a natural matrix generalization to the elementary 
scalar equation 

bx s = p, 

in which b > 0, p > 0, s € Z + and x is a nonnegative real indeterminate. One 
difficulty with an extension is dealing with matrix noncommutativity, while another 
is determining what should be meant by the words "real" and "nonnegative." For- 
tunately for us, the latter concerns have already long been addressed: the natural 
matrix interpretation of the reals are the Hermitian matrices, while nonnegative 
(resp. positive) numbers correspond to those complex Hermitian matrices with all 
nonnegative (resp. positive) eigenvalues, the so-called positive semidefinite (resp. 
positive definite) matrices. The issue of noncommutativity, however, is of a more 
subtle nature, and we first introduce some notation before addressing it. 

Fix a positive integer k, and let W — W(X, B\, ... , Bk) be a word in the letters 
X and B\, . . . , B^. The reversal of W is the word written in reverse order, and it 
is denoted by W*. A word is symmetric if it is identical to its reversal (in other 
contexts, the name "palindromic" is also used). As we shall soon see in Sections [5] 
and[3l formulating our generalization requires restriction to a special class of words. 
For the purposes of this work, an interlaced word W = W(X, B\, . , . ,-Bfe) in the 
interlacing letter A is a juxtaposition of powers of letters that alternate in powers 
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of X . More precisely, an interlaced word is an expression of the form, 

m 

(1) W = B^]JX^B^, 

3=1 

in which the exponents Pj > 0, qj > are nonnegative integers, m > 1, and 
{il, . . . , i m +i} C {1, . . . , A:}. (Here, of course, we consider the zeroth power of a 
letter to be the empty word, the identity element of the monoid). For example, 
the word BiXBlX 2 B%X 5 is interlaced, whereas the word XBiB 2 XB 2 BiX is not. 
The integer s = p% + ■ ■ ■ + p m is called the degree of the interlaced word W . 

The interlacing letter X is distinguished, and is to be viewed as an indeterminate 
n x n positive semidefinite matrix, while the letters B\ , . . . , Bk correspond to fixed 
n x n positive definite matrices. For convenience, the letters X and Bi will also 
represent the substituted matrices (the context will make the distinction clear). 
When k = 1, the set of interlaced words is simply the set of all words in two letters 
containing at least one X. For notational simplicity, when k is understood, we 
write W(X, Bi) in place of W{X, B u ..., B k ). 

Returning to our motivating example, notice that there is a unique nonnegative 
solution to the equation bx s = p for every pair of positive b and nonnegative p; we 
would like to generalize this observation. Our introductory remarks prepare us to 
make the following definition. 

Definition 1.1. A symmetric word equation is an equation, S(X,Bi) — P, in 
which S(X, Bi) is an interlaced symmetric word. If the Bi are positive definite and 
P is positive semidefinite, then any positive semidefinite matrix X for which the 
equation holds is called a solution to the symmetric word equation. 

A symmetric word equation will be called solvable if there exists a solution for 
every positive definite nxn matrices Bi and nxn positive semidefinite P. Moreover, 
if each such Bi and P gives rise to a unique solution, the equation will be called 
uniquely solvable. We are motivated by the following striking result. 

Theorem 1.2 (Hillar and Johnson). Every symmetric word equation is solvable. 
Moreover, if the parameters P and Bi are real, then there is a real solution. 

Theorem 1 1 . 2 1 first appeared in [141 Theorem 7.1] with an argument that involved 
fixed-point methods. The authors of [14] also conjectured that symmetric word 
equations have unique solutions. 

Conjecture 1.3. Every symmetric word equation is uniquely solvable. 

There is much evidence to support Conjecture 11.31 For instance, there are large 
classes of equations that are uniquely solvable (see Section where we encounter 
the class of totally symmetric word equations), and recently, Lawson and Lim [20] 
have verified the conjecture in the case that the degree of S(X,Bi) is not greater 
than five. Their approach utilizes the Riemannian metric on the set of positive 
definite matrices and Banach's fixed-point theorem. In addition, every numerical 
investigation that we are aware of has failed to produce multiple solutions (see also 
Remark 111.31 below) . 

Symmetric word equations are not only natural from a theoretical perspective; 
they also arise in many other contexts. For instance, they play a role in recent 
attacks [12l[T3l[T5] on the Bessis-Moussa-Villani trace conjecture [3], a long-standing 
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problem in statistical physics. A brief overview of this application is given in Section 

El 

One well-known matrix equation is the Riccati equation: 

(2) XBX T = P. 

In general, the Riccati equation (EJ) has many solutions; however, when B and P 
are positive definite and we seek positive definite solutions X, it is equivalent to 
the symmetric word equation XBX = P. The unique positive definite solution X 
to this equation is given by 

(3) X = p 1 / 2 (p- 1 / 2 B^ 1 P~ 1/2 ) 1/2 P 1/2 . 

This fact has been observed by many authors independently (see for example 
[3 El US Q21 EH [30]). The right-hand side of Q is called the geometric mean 
of P and B , and is written as P#£? _1 . The Riccati equation and the corre- 
sponding geometric mean are ubiquitous. They appear, for example, in work on 
matrix inequalities [U E] , the theory of optimal transportation E2 ESI El HH [30] , 
convex optimization and control theory I29j , and the geometry of non-compact 
symmetric spaces ETJ E3] ■ 

In this paper, we resolve Conjecture 11.31 negatively in the case n > 3. The 
conjecture remains open for 2x2 matrices (although see Theorem 1 1 1 .41 for a proof 
of uniqueness in a special case). 

Theorem 1.4. There are symmetric word equations of degree 6 which have multiple 
real 3x3 positive definite solutions. 

Theorem 11.41 shows that the result of Lawson and Lim [20] is optimal. Although 
uniqueness fails in general, our approach allows us to verify that these equations 
are still very well-behaved in the following sense. 

Theorem 1.5. Fix an interlaced symmetric word S and real positive definite ma- 
trices B\, . . . , Bk and positive semidefinite P. Then, real solutions X to the word 
equation f(X) — S(X,Bi) — P are bounded. In addition, if P is invertible, then 
there is a bounded open subset U of real positive definite matrices ( containing all 
real solutions) such that 

deg(f,U.P) = l. 

Here, deg(/, U, P) is the Brouwer degree of / at P with respect to U; in a vague 
sense, it gives a topological measure of the number of solutions inside U to the 
equation f(X) = P. It is in this sense that Theorem 11.51 verifies the intuition of 
Conjecture 11.31 Theorem 11.51 is powerful enough to show that while uniqueness 
fails in general, generically, the number of solutions to a symmetric word equation 
is finite. 

Corollary 1.6. Fix positive definite matrices B\, . . . , B^. Then, for almost every 
real positive definite matrix P , the symmetric word equation 

S(X, Bi) = P 

has an odd (and thus finite) number of real positive definite solutions X. 

Proof. By Theorems ll.5l and l7.31 at any regular value P of the map X i— > S (X, Bi), 
the equation S(X, Bi) = P has an odd number of solutions X. By Sard's theorem, 
the set of regular values is a set of full measure, completing the proof. □ 
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The hypothesis of Theorem 11.51 requiring the symmetric word to be interlaced 
cannot be dropped: there exist non-interlaced symmetric word equations with an 
unbounded set of solutions (see Example 12. 5|) . Theorem 11.51 also implies a special 
case of Theorem 11.21 giving a second proof of existence in the real case. 

Corollary 1.7. Every symmetric word equation in real positive definite letters has 
a real positive semidefinite solution. 

Proof. The result follows from Theorem 1 1.5[ Theorem 16. 2\ and Lemma mi □ 

The proof of Theorem 1 1.51 is the content of Sections [8l [9l and [TOl The arguments 
in the proof often employ the reductions found in Section [6l Some consequences 
of Theorem 11.51 are explored in Section [Til including a proof of Theorem 11.41 In 
Sections [2] and [3] we explain our restriction to interlaced symmetric words, and 
Sections 0] and [S] are devoted to applications and a special class of uniquely solvable 
words, respectively. In Section [7] we review the theory of Brouwer degree. 

The authors would like to thank Bill Helton for several interesting conversations 
about this problem and Robert McCann for his helpful advice and references. 

2. Examples 

The simplest instance of a symmetric word equation arises in the following ex- 
ample T7J p. 405]; it is the most straightforward generalization of the scalar case. 

Example 2.1. Let P be any positive semidefinite matrix and let S(X) be the word 
X m , for a positive integer m. Then, there is a unique positive semidefinite solution 
to the equation S(X) = P. In fact, writing P = UDU* for a unitary matrix U and 
a nonnegative diagonal matrix D, we have 

Our next example is the Riccati equation, which we encountered in the intro- 
duction. 

Example 2.2. Given positive definite B and positive semidefinite P, the equation 
XBX = P has a unique positive semidefinite solution X , given by 

X = B-^P = B-^iB^PB^^B- 1 / 2 . 

Uniqueness can be deduced from the proof of Provosition [572[ in which a large class 
of word equations are shown to be uniquely solvable. When P is invertible, this 
solution can also be expressed as 

X = P^B- 1 = pl/2(p-l/2 B -lp-l/2 ) l/2pl/2. 

i.e., the geometric mean satisfies Aff^B — Bff^A for all positive definite matrices A 
and B. At first glance, this is surprising, since the expression 

AjfB = A 1 ' 2 (A-'^BA- 1 ' 2 ) 1 ' 2 A 1 ' 2 

does not appear to be symmetric in A and B. □ 

As promised, we now explain why we restrict our attention to interlaced sym- 
metric words. A first obstacle in generalizing the scalar case is that most words 
do not evaluate to positive semidefinite matrices upon substitution. One simple 
example is the word XB, which does not even have to be Hermitian when X and 
B are positive definite. Similarly, the unique matrix solution X of the equation 
XB = P is not, in general, positive semidefinite. It turns out that the right class 
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of words to consider are the symmetric ones, and this is evidenced by the following 
discussion. 

Recall that two n x n matrices X and Y are said to be congruent if there is an 
invertible n x n matrix Z such that Y = Z* XZ (here, C* denotes the conjugate 
transpose of a complex matrix C); and that congruence on Hermitian matrices 
preserves inertia (the ordered triple consisting of the number of positive, negative, 
and zero eigenvalues) and, thus, positive definiteness |17| p. 223]. A symmetric 
word evaluated at positive definite matrices is inductively congruent to the "center," 
positive definite matrix. We conclude that 



Lemma 2.3. 

definite. 



A symmetric word evaluated at positive definite matrices is positive 



A more careful examination (or a simple continuity argument) also proves the 
following. 

Lemma 2.4. A symmetric word evaluated at positive semidefinite matrices is pos- 
itive semidefinite. 

Conversely, it may be shown that symmetric words are the only words that are 
positive definite for all positive definite substitutions (see Section [3] for a proof). In 
light of these facts, restricting our consideration to symmetric words seems appro- 
priate. 

Next, we discuss the difficulties that arise when considering non-interlaced sym- 
metric words. As the following examples demonstrate, both finitcncss and existence 
may fail even when k = n = 2 and s = 3. 

Example 2.5. Let S(X,B 1 ,B 2 ) = XB 1 B 2 XB 2 B 1 X and set 



B 1 = 



3 —I " 


, B 2 = 


ts3 


I " 


— I I 


1 


1 



p = 



' 


" 


and X = 


x/5 —x 









X 




—x 5x 



Then, as is easily verified, the equation S(X, B\, B 2 ) = P has symmetric solutions 



X = 



in which x is an arbitrary real number. In particular, there are infinitely many 
positive semidefinite solutions (in two distinct unbounded solution classes). Notice 
also that the kernel of a solution X and that of P can be different. For interlaced 
words, this situation cannot occur (see Lemma \6.1\) . □ 

Example 2.6. Let S and B\,B 2 be as in the previous example, but instead set 





P 



1 



X = 



Then, there are no positive semidefinite solutions to S{X, B\, B 2 ) = P. To verify 
this, suppose that 

' e / 
g h 

is a complex solution to S(X, B\, B 2 ) = P. Computing the ideal generated by the 
4 consequent polynomial equations ( using Maple or Macaulay 2 to find the reduced 
Grobner basis), we find that it is the entire ring C[e, /, g, h] . In particular, there are 
no matrix solutions over C to the given equation, much less positive semidefinite 
ones. □ 
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3. Relations Between Positive Definite Words 



In this section, we explain our restriction to symmetric words. Specifically, we 
prove that a word VF(^4, B) in two letters A and B is positive definite for all positive 
definite substitutions if and only if the word is symmetric. 

We begin by illustrating some of the subtlety of the problem. Let B and P be 
positive definite matrices. In Example 12.21 we saw that 



pl/2 (p-l/2 B -lp-l/2\ 1/2 pl/2 = B 



-1/2 f B y 



x 1/2 

/2ppl/2 £-1/2 



even though both expressions are quite different. In fact, both sides of the above 
equality are the unique solution X to the symmetric word equation, 

S(X, B) = XBX = P. 

Fortunately, such behavior does not occur with words, as the following discussion 
illustrates. 

Let W be the set of words in two letters A and B, and fix a, b to be two n x n 
complex matrices. Consider the evaluation homomorphism Eval a .b ■ W — > IM„(C) 
which sends a word W(A,B) to the matrix W(a,b) produced by substituting the 
matrices a and b for the letters A and B, respectively. By convention, the empty 
word is sent to the identity matrix by this map. We describe a pair of positive 
definite a and b for which this function is injective. 

Lemma 3.1. The map Eval a ^b is injective when 





' 3 


1 " 




' 1 


1 " 


a = 


1 


1 


, b = 


1 


3 



Proof. Let a, b be the matrices in the statement of the lemma, and let W\ and W% 
be two words for which Wi(a,b) = W<2,{a, b); we must show that W\ and are 
the same word. If either W\ or Wi is the empty word, then the claim is clear (take 
a determinant). Furthermore, since a and b are invertible, we may suppose that 
W\ = AU and W% — BV for some words U and V. 

Let x and y be indeterminates. Given a word W , we set 



W x x + W v y 
W x x + W y y 



W(a,b) 



for natural numbers W x ,W V ,W x ,W y . Notice that by our choice of a and b, we 
cannot have both W x and W v equal to zero. A direct computation shows that 
(AU) X -(AU) X = 2U X and that (BV) X -(BV) X = -2V X . By assumption, these two 
numbers are equal so that U x + V x = 0. Since these two quantities are nonnegative 
integers, it follows that U x = V x = 0. Similarly, the equality (AU) y — {AU) y — 
(BV) V ~(BV) y implies that U v = V y = 0. This contradiction finishes the proof. □ 

Corollary 3.2. The following are equivalent for a word W . 

(1) W is positive definite for all substitutions of positive definite A and B 

(2) W is Hermitian for all substitutions of positive definite A and B 

(3) W is Hermitian for all 2 x 2 substitutions of positive definite A and B 

(4) W is symmetric ("palindromic") 

In particular, if a word is Hermitian for all 2x2 substitutions of positive definite 
A and B, then the word is necessarily positive definite for all such substitutions. 
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Proof. (1) (2) => (3) is clear. If W(A,B) is always Hermitian for 2 x 2 positive 
definite A and B, then W(A,B)* = W(A,B) for all such A and S. But then 
Lemma \3. II savs that W* and 14 7 must be identical as words. It follows that W is 
symmetric. This proves (3) =>■ (4). Finally, if W is symmetric, Lemma 12.31 savs 
that W will always be positive definite for any positive definite A and B. This 
completes the proof. □ 

4. An Application 

We first encountered symmetric word equations when studying a trace conjecture 
[T5] involving words in two letters A and B (see also fTS]). 

Conjecture 4.1. j4 word m iwo letters A and B has positive trace for every pair 
of real positive definite A and B if and only if the word is symmetric or a product 
(juxtaposition) of 2 symmetric words. 

For each solvable symmetric word equation, one can identify an infinite class 
of words that admit real positive definite matrices A and B giving those words a 
negative trace. The following is a brief description of this application. Consider 
the word W — BABAAB, which is not symmetric nor a product of two symmetric 
words. In light of Conjecture 14.11 we would like to verify that there exist real 
positive definite matrices A and B giving W a negative trace. This is surprisingly 
difficult, as the methods in 13J show. Resulting A and B that exhibit a negative 
trace are, for example, 



1 20 210 

20 402 4240 
210 4240 44903 



and B\ 



36501 -3820 190 
-3820 401 -20 
190 -20 1 



We will run into these matrices again in our proof of Theorem 11.41 

Consider now the following extension. Let T be the word given by T = S1S2, in 
which Si and S2 are symmetric words in the letters A and B. If the simultaneous 
word equations 

S 1 (A,B) = B 1 , 
S 2 (A,B)=A 1 

may be solved for positive definite A and B given positive definite A\ and B\, then 
the word TTT* can have negative trace. Specializing to the case that S2 is the 
word A, we have the following. 

Corollary 4.2. Let S = S(A, B) be any symmetric word with at least one B . Then 
the word SASAAS admits real positive definite matrices A and B giving it negative 
trace. 

Proof. The matrix B1A1B1A1A1B1 has negative trace. Using Corollary 11.71 the 
equation S(A\,X) — B\ has a real positive definite solution X = B%. The two 
matrices B = B2 and A = A\ are then the desired witnesses. □ 

Conjecture 14.11 while interesting in its own right, arises from an old problem 
in statistical physics, the Bessis-Moussa-Villani conjecture. In [3], while studying 
partition functions of quantum mechanical systems, a conjecture was made regard- 
ing a positivity property of traces of matrices. If this property holds, explicit error 
bounds in a sequence of Pade approximants follow. Recently, in [22j , and as previ- 
ously communicated to the authors of [13], the conjecture of [3] was reformulated 
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by Licb and Sciringer as a question about the traces of certain sums of words in 
two positive definite matrices. 

Conjecture 4.3 (Bessis-Moussa-Villani). The polynomial p(t) = Tr [(A + tB) m ] 
has all positive coefficients whenever A and B are n x n positive definite matrices. 

The coefficient of t k in p(t) is the trace of H m ^(A, B), the sum of all words of 
length m in A and £?, in which k B's appear. Since its introduction in [3], many 
partial results and substantial computational experimentation have been given [6l 
El HH [HI [28] , all in favor of the conjecture's validity. However, despite much work, 
very little is known about the problem, and it has remained unresolved except in 
very special cases. Until recently, even the case m = 6 and n — 3 was unknown. 
In this case, all coefficients, except Tt[Hq^(A, B)] were known to be positive [IB] . 
The remaining coefficient Ti[Hq^(A, B)} can be shown to be positive, but the proof 
requires notably different methods [15j . The difficulty is that some summands of 
Hs t s(A, B) can have negative trace, precisely the types of words such as BABAAB 
considered above. The matrices A\ and B\ above were also used by Hansen [10] in 
his approach to this trace conjecture. 

A recent advance [12] has been the derivation of a pair of equations satisfied by 
A and B with Euclidean norm 1 that minimize a coefficient Tr[H m ^(A, B)]: 



It is possible that some of the techniques developed here can be applied to these 
more general types of word equations. 



In this section, we describe a class of words that are uniquely solvable with 
solutions that can be constructed explicitly. These words generalize those found in 
Examples O and l2~2l 

Definition 5.1. A symmetric word is called totally symmetric if it can be expressed 
as a composition of maps of the form 

(1) n m , Bi (W) = (WBi) m W, m a positive integer 

(2) ip m (W) = W m , m a positive integer 

(3) C Bi (W) = BiWBi 

applied to the letter X. 

For example, the word W — B\X 2 B2X 2 B2X 2 B\ may be expressed as the com- 
position, Cb-l o H2,b 2 l P2(X). The utility of this definition becomes clear from the 
following proposition. 

Proposition 5.2. For every totally symmetric word S(X, Bi) and every positive 
definite Bi and positive semidefinite P, the equation S(X, Bi) — P has a unique 
positive semidefinite solution X . 

Proof. We induct on the number of compositions involved in the word S; the base 
case S = X being trivial. If S = (p m (W) for some word W, then W = P 1 /" 1 is 
a smaller totally symmetric word equation and any solution X to S (X, Bi) = P 
satisfies it. A similar statement holds when S = (W) (using Lemma liO)) , leaving 
us to deal with / K m .B i ■ 



( 



AH m -i, k (A,B) 
BH m ^i t k-x(A, B) 



A 2 Tr[AH m _ hk (A,B)} 
B 2 Tr[BH m ^ lik _i(A, B)]. 



5. A Class of Uniquely Solvable Equations 
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Without loss of generality, we prove the result for the equation (XB) m X = P. 
Assume that B and P are given and that X is a solution to (XB) m X = P. Set 
Y = B 1 I 2 XB 1 / 2 , so that X = B~ l / 2 Y 'B^l 2 . Then, 

p = ^- 1 / 2 YB 1/2 ) m B^ 1/2 YB^ 1/2 = B~ 1/2 Y m+1 B~ 1/2 . 

Therefore, Y m+1 = B 1 ' 2 PB 1 / 2 , from which it follows that Y is uniquely deter- 
mined as (i3 1 / 2 PS 1 / 2 ) 1 /(™ l + 1 ). Hence, X must be the positive semidefinite ma- 
trix j B-i/2( j Bi/2p B i/2)i/(m+i) B -i/2 p inally; substituting this X into the original 
equation does verify that it is a solution. This completes the proof. □ 

The shortest symmetric word equation without a known (closed-form) solution, 
as above, is XBX 3 BX = P (although it is uniquely solvable [10]). An exploration 
of which equations give rise to such explicit solutions is the focus of future work. 

6. Reductions 

The purpose of this section is to make some reductions that simplify the problem. 
Given the nature of Theorem 11.21 and Conjecture [L3j we begin by noticing that we 
may assume our interlaced symmetric words are of the following form: 

(4) S = X Pl B 1 X P2 B 2 ---B 2 X P2 B 1 X Pl , 

in which the exponents pj are positive. This simplification is accomplished by 
observing first, that powers of positive definite matrices are positive definite; and 
second, that congruences of positive semidefinite P are positive semidefinite. 

We next establish that it suffices to verify our claims when P is invertible. We 
begin with a useful lemma. 

Lemma 6.1. Let pi, . . . ,p/. > and let B\, . . . , B^-i be positive definite matrices. 
Then, for any positive semidefinite matrix X , we have 

ker X = ker X Pk B k ^ ■ ■ ■ B 2 X P2 B 1 X Pl . 

Proof. Set X = UDU* for a unitary matrix U and D = diag(Ai, . . . , A„), in 
which Ai > ... > A n > 0. Let Y = X Pk B k ^ ■ ■ ■ B 2 X P2 BxX Pl , and notice that 
ker U*XU = ker U*YU if and only if ker A = ker Y. Thus, it suffices to argue that 

kerL> = ker L^S^i • • • B 2 D P2 BxD Pl , 

whenever the Bi are positive definite matrices. 

Let m be the largest integer such that A m ^ 0, and for each i, let Bi denote the 
m x m leading principal submatrix of Bi, which will be positive definite (see, for 
instance, [T71 p. 472]). Additionally, set D = diag(Ai, . . . , A m ). A straightforward 
block matrix multiplication then gives us that 



(5) £> Pfe -B fe _i • • • B 2 D P2 B X D P1 = 



D p "B k - 1 ---B 2 D P2 B 1 D Pl 




Since the leading principal m x m matrix in this direct sum is invertible, the claim 
follows. □ 

Using this lemma, we can prove the following reduction. 

Theorem 6.2. // a symmetric word equation has a solution for every positive def- 
inite Bi and P , then the symmetric word equation has a solution for every positive 
definite Bi and positive semidefinite P. 
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Proof. Performing a uniform unitary similarity, we may prove the theorem with the 
supposition that P is of the form, 

P 
J ' 

for a positive diagonal matrix P of rank m. Lemma 16.11 implies that any positive 
semidefinite solution X to the symmetric word equation S(X, B^) = P has the 
same block form as P. As in the lemma, let Bi denote the m x m leading principal 
(positive definite) submatrix of each Bi . 

From these observations, it follows that positive semidefmite solutions X to the 
equation S(X,Bi) — P correspond in a one-to-one manner with positive definite 
solutions X to the equation S(X , Bi) = P. This completes the proof. □ 

The proof above also shows that the question of uniqueness found in Conjecture 
11.31 may be simplified. 

Theorem 6.3. If a symmetric word equation has a unique solution for all positive 
definite matrices Bi and P , then the symmetric word equation has a unique solution 
for all positive definite Bi and each positive semidefinite P. 

We close this section with an interesting interpretation of unique solvability. 

Proposition 6.4. Fix positive definite matrices Bi in the unit ball and an interlaced 
symmetric word S(X, Bi) whose equations are uniquely solvable. Then, the mapping 
X i ► S(X,Bi) from the set of positive semidefinite matrices in the (closed) unit 
ball to its image is a homeomorphism. 

Proof. The assumptions imply that our map is bijective. Since the set of positive 
semidefinite matrices in the unit ball is compact, it follows that its inverse is also 
continuous. □ 



7. Brouwer Mapping Degree 



In this section, we give a brief overview of degree theory and some of its main 
implications. The bulk of this discussion is material taken from [24l [32] . First 
we introduce some notation. Let U be a bounded open subset of R m . We denote 
the set of r-times differentiable functions from U (resp. U) to R m by C r (U,R m ) 
(resp. C r (U, R m )) (when r = 0, C r (U, R m ) is the set of continuous functions). The 
identity function 1 satisfies l(x) = x. If / 6 C {U, R m ), then the Jacobi matrix of 
/ at a point x 6 U is 



dxi ^ ^ 



l<z,j<m 

and the Jacobi determinant (or simply Jacobian) of / at x is 

det Jf(x). 

The set of regular values of / is 

HV(/) - {ye R m : Vx e /-%), J/(x) + 0} 
and for y 6 R m , we set 

Dy(U, R m ) = {fe C r (U, R m ) : y g f(dU)} . 
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A function deg : D y (U, R m ) — > R which assigns to each y G R m and / G 
Dy(U, R m ) a real number deg(/, U, y) will be called a degree if it satisfies the fol- 
lowing conditions: 

(1) deg(/, U, y) = deg(/ — y, U, 0) (translation invariance). 

(2) deg(l, J7, y) = 1 if y G U (normalization). 

(3) If Ui and ^2 are open, disjoint subsets of U such that y ^ f(U\ (U\ UU2)), 
then deg(/, C/, y) = deg(/, Ui, y) + deg(/, C/ 2 , y) (additivity) . 

(4) If ff(t) = £/ + (1 - G D°(i7, R m ) for all t G [0, 1], then deg(/, f7,y) = 
deg(g, [/, y) (homotopy invariance). 

Motivationally, one should think of a degree map as somehow "counting" the 
number of solutions to /(x) = y. Condition (1) reflects that the solutions to 
/(x) = y are the same as those of /(x) — y = 0, and since any multiple of a 
degree will satisfy (1) and (3), condition (2) is a normalization. Additionally, (3) 
is natural since it requires deg to be additive with respect to components. The 
following lemma gives a method to show the existence of solutions to /(x) = y by 
calculating a degree. 

Lemma 7.1. Suppose that f G D y (U, R m ). If a degree satisfies deg(/, U, y) 7^ 0, 
thenyef(U). 

Proof. Using property (3) above with U± — U and U2 — 0, we must have that 
deg(/,0,y) = 0. Again using (3) with U 1 = U 2 = 0, it follows that if y £ f(U) then 
deg(/, U, y) = 0. The contrapositive is now what we want. □ 

Of course, we need a theorem guaranteeing that a degree even exists. 

Theorem 7.2. There is a unique degree deg. Moreover, dcg(-, U, y) : D y (U, R m ) — > 
Z. 

When functions are differentiable, the degree can be calculated explicitly in terms 
of Jacobians at solutions to the equation /(x) = y. 

Theorem 7.3. Suppose that f G D y (U, R m ) and y G RV. Then the degree off at 
y with respect to U is given by 

deg(/,I7,y)= sgn det J/(x), 

where this sum is finite and we adopt the convention that X) x g0 = 0- 

The final property of Brouwer degree that we will need is a stronger form of 
homotopy invariance than that provided by Property (4). We say that a function 
H : U x [0, 1] -> R m is a C° homotopy between f,g€ C r (U, R m ) if H is continuous 
on U x [0, 1] and if H(x, 0) = f(x) and H(x, 1) = g(x) for all x G U. 

Theorem 7.4. Suppose H is a C° homotopy between f,g G D y (U,R m ). Set 
h t (x) = H(x 1 t) and suppose that for each t G [0,1], h t G D y (U,R m ). Then 
deg(/,E/,y) = deg(g,U,y). 

8. Estimates of Solutions 

This section is devoted to estimating the norms of positive definite solutions of 
symmetric word equations. In particular, we show that the set of positive definite 
solutions to a fixed symmetric word equation S(X, Bi) = P is bounded. Our 
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estimate is the first step in a proof of Theorem 11.51 In what follows, we will be 
using the spectral norm [T7l p. 295] on the set of n x n matrices, so that for positive 
semidefinite A, the norm of A is just the largest eigenvalue of A. 

Lemma 8.1. Fix an interlaced symmetric word S(X,Bi) and a number a > 1. 
Then there exists a constant C = Cs.a depending only on S and a such that for 
all positive definite matrices Bi with \\Bi\\ < 1 and H-B" 1 !! < a and all positive 
semidefinite matrices P with ||P|| < 1, we have the estimate 

(6) U\\<C 

for any solution X of the word equation S(X, Bi) = P. 



Proof. We proceed by way of contradiction. If the statement is false, then for 
each positive integer j there exist positive semidefinite matrices Xj , Pj and positive 
definite matrices Bi^ such that S(Xj , B^j) — Pj, where ||-Bjj[| < 1, \\B^~j \\ < a, 
\\Pj\\ < 1, and \\Xj\\ > j. By taking a subsequence, if necessary, we may assume 
that there are positive semidefinite matrices Bi,P and X such that Bij — > Bi, 
Pj — > P, and HXj II^X,- — » X as j ' ^ oo. It is clear that 

(7) ||*|| = 1. 

Since H-Br^. || is bounded uniformly in j, each Bi is positive definite. Let s be the 
degree of 5*. Since \\Xj\\ > j for all j, if we let j — > oo in the equation 

S(\\X j \\- 1 X j ,B iJ )=\\X j \\- s P j , 

it follows that 

S(X, Bi) = 0. 

Finally an application of Lemma lG . 1 1 gives X = 0, which contradicts ([7]) and finishes 
the proof. □ 



Lemma [8.11 allows us to estimate \\X\\ in terms of the norms of the Bi and the 
norm of the word S(X, Bi). 

Proposition 8.2. Fix an interlaced symmetric word S(X, B\, . . . , Bk) of the form 
with degree s and a number a > 1. There exists a constant C = Cs.a depending 
only on S and a such that for all positive definite matrices Bi with ||-Bj|| \\B~ \\ < a 
and any positive semidefinite X we have 

(8) \\X\\<C\\B 1 \\-i---\\B k \\-i\\S(X,B l )\\i. 

Proof. Let C — Cs, a be the constant in Lemma [8Jl By Lemma Rm if S(X, Bi) = 0, 
then X = 0, and the bound is trivial. Otherwise, set Bi = P = S(X, Bi), 

P = \\P\\-]P, and X = \\Bif. ■ ■ ■ \\B k \\i\\P\\-ix. Since \\B^\\ = WBiUB^W < a 
and S(X, Bi) = P, we may apply Lemma \8A\ to get that 



□ 



(9) ||*|| < C. 

Substituting X = 1 1 B\ \ \ ? • ■ • 1 1 B k \ \ > 1 1 P \ \ _ * X into ((9|) and rearranging produces 
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9. Calculation of Jacobi Matrices 



From here on, we will assume all matrices are real. We shall identify M n = M„(IR) 
with R d , where d — n 2 , by means of the vec operator. If A — [a^] € M„ then vccA 
is the column vector obtained by stacking the columns of A below one another: 

vec A = [an • ■ ■ a n i OL12 a nn ] T . 

Recall that the Kronecker product of two n x n matrices A and B is the matrix 

auB ■ ■ ■ a\ n B 



A®B 



a n \B 



a nn B 



The following lemma can be found in 25] page 30]. We reproduce the proof 
below for the reader's convenience. 

Lemma 9.1. IfA,B,X <E M n , then 

vec (AXB) = (B T ® A)vec X. 

Proof. For a given matrix Q, let Qk denote the kth column of Q. Let B = [6^]. 
Then 



(AXB) k = AXB k 

= A (jSjfcXi) = [ b lk A 



b nk A ] vecX. 



Therefore, 



vec(AXB) = 



b n A 

hnA 



bnnA 



vecX = (B T (g> A)vecX. 



□ 



Suppose that Y(X) € M„ is a function of the matrix variable X e M n . Following 
P5] , we define the derivative of y with respect to X to be the Jacobi matrix of 
vecy with respect to vecX. That is, if [yi,. .. ,yd] T = vecy and [xi,... ,Xd] T = 
vecX, then 



dY 
dX 



dxj 



Notice that it follows from Lemma 19.11 that 



(10) 



d(AXB) 
dX 



= B A 



A. 



Using (|10p . we derive a matrix calculus version of the product rule (see 
more on matrix calculus). 



for 



Proposition 9.2. Let Y(X) 6 Y\ n and Z(X) G IM n be functions of the matrix 
variable X £ IMn- Then 



(11) 



d(YZ) , T s dY dZ 
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Motivated by Theorem 17. 3( we want to calculate the derivative of a word 
W = W(X, Bi, . . . , Bk). To state the result, we need to introduce some notation. 
Let W have degree s > 1. Enumerate the occurrences of X in W(X,Bi) from left 
to right, and for each j e {1, . . . , s} let W^{X, Bi) be the portion of W(X, Bi) that 
appears to the left of the jth occurrence of X. For instance, if 

W(X, B U B 2 ) = B\XB\B 2 XB 2 B x X 2 B 2 X, 

then Wl(X,B 1 ,B 2 ) = BlXB\B 2 XB 2 B x X . We adopt the convention that W£ = 
I if X is the first letter of the word. In a similar way we define Wj i (X, Bi) to be the 
portion of W(X, Bi) that appears to the right of the jth appearance of X. Notice 
that 

W(X, Bi) = Wf(X, Bi)XWf(X, Bi) 

for any j = 1, . . . , s. 

Proposition 9.3. Let W = W(X, Bi) be a word of degree s, and Bi G M„. Then 

(12) ^ = £ Wf(X, Bif ® W?(X, Bi). 

i=i 

Proof. We proceed by induction on the length of W. For the words X and BX 
(B = Bi, . . . , Bk), IT2"]) is a special case of (fTOj) . Now suppose that (IT21) holds for a 
fixed word W = W(X, Bi) of degree s. Pick B E {B 1 , . . . , B k } and set 

W(X,Bi)=W(X,Bi)B. 

Then dTUD and ^TQ imply that 

dW ,„ T T .dW 

= (B 1 ®I) 

dX K J dX 



{B T ® I) ^ (X, B,) T ® (X B< 



(Jf, B 4 )S) T <g> VF/(X, Si 



S 

so formula (fT2")l holds for W. 

Next set W(X, S 4 ) = W{X 1 B i )X. Appealing again to (JT0J) and (JTTJ) , we com- 
pute: 



= (x T ®i)J2 w ? ( x > B *) T ® ^/ ( x ' B + ( J ® ^i+i) 

s+l 

= 51 w f B ^ T ® w f( x > 



3=1 

and so fT2")) holds for W . This completes the induction and the proof. □ 
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We next write down expression (|12|) for some explicit interlaced symmetric words, 
beginning with the most basic one. 

Example 9.4. For a positive integer s, the Jacobi matrix ofvecX s with respect to 
vecX is given by 

as) ^=£(^') r 

In particular, since the Kronecker product of two positive (semi) definite matrices 
is also positive (semi) definite (see jTSJ P- 245]j, j s positive (semi) definite 

whenever X is positive (semi) definite. □ 

Example 9.5. Consider the symmetric word S in two letters given by 

S(X,B) = XBX 2 B 3 X 2 BX. 

If B is positive definite and X is symmetric, then 

— = XBX 2 B 3 X 2 B®I + XBX 2 B 3 X®XB + XBX 2 B 3 ®XBX 

+XBX ® XBX 2 B 3 +XB® XBX 2 B 3 X + 1® XBX 2 B 3 X 2 B. □ 

10. The Brouwer Degree of Symmetric Word Equations 

Our proof of Theorem 11.51 consists of two main steps. In the first, we calculate 
the degree of the simple map ip s (X) — X s and show that it is 1. And in the 
second, we create a homotopy from the function f(X) = S(X,Bi) to f s (X) and 
apply Theorem 17.41 Before initiating our proof, we need to identify the set of real 
positive definite matrices with an open set in Euclidean space. To this end, we 
identify the set Sym„ of real symmetric matrices with R m , in which m = ^n(n+ 1), 
by identifying a real symmetric matrix A = [a»j] with the point 

H{A) = (an,... , a n \ , 0,22 , • • • j On2 , ■ ■ ■ ) CL nn ) . 

More precisely, if A £ IM„ then we define n(A) — (yi, . . . , y m ), where 

yi(2n-j)(j-i)+i = 1 < 3 < i < n. 

The restriction /i|s y m„ is a linear isomorphism from Sym n onto R m . We denote by 
v the inverse of /i|sym • Let 

O = {[i{X) | X is positive definite}. 

The set of positive definite matrices is therefore identified with the open subset 
O C R m , and the set of positive semidefinite matrices is identified with the set O. 
Define a function p s : R m -> R m by 

tp s = (J, o (p s o v. 

Since ip s maps Sym n into itself, it follows that Zp s {n{X)) = n(X s ) for every sym- 
metric matrix X. We intend to show that det J^ s (fi(X)) > when X is positive 
definite. First, however, we need a lemma describing a relationship between eigen- 
values of Jacobi matrices for functions / : R d — > R d and their restrictions / to 
certain subspaces. In what follows, the set of eigenvalues of a matrix H is denoted 
by a(H). 
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Lemma 10.1. Let f : R d -> R d be a C 1 map and V C R d be a linear subspace 
of R d such that f(V) C V. Let tt : R m — > V be a linear isomorphism, and let 
f : R m — > R m be given by f = ir^ 1 o f o n. Then for every x£l/, we have 

In particular, if x £ V and J/(x) is positive definite, then Jr(7r (x)) has positive 
eigenvalues. 

Proof. Let {e±, . . . , e^} be the standard basis for R d . By choosing a linear change 
of variables u : R d — > R d such that u(V) = spanjei, . . . , e m } and considering the 
C 1 map g — u o / o we may reduce to the case that V = spanjei, . . . , e m }. We 
may likewise assume that tt(zi, . . . , z m ) — (z\, . . . , Z m , 0, . . . , 0). 
Write / = (/i , . . . , fd) and let x e V. If j < m < k, we have 

f k (-K + te 3 ) = for alH G R 

since /(V) C y. Therefore, 

(14) |^(x) = for all j < m < k. 

OXj 

It follows that J/(x) has the block form 



Jo * 
* 



in which J is the m x m leading principle submatrix of J/(x). In particular, this 
implies that cr(Jo) C ct(J/(x)). It is straightforward to verify that 

f (xi , . . . , X m ) — i^f\{x\, . . . , x m , o,...,o),...,/ m ( 0,...,0)), 
from which it follows that Jj(7r _1 (x)) = Jo. This proves the lemma. □ 

Lemma 10.2. At any positive definite matrix X, the Jacobi matrix J{p g (p(X)) of 
the map tp s has positive eigenvalues. In particular, det J^ s (n(X)) > 0. 

Proof. Let V = {vecX \ X E Sym„} be the linear subspace of R d identified with 
Sym Tl . The function <fi s : R d — * R d defined by 

4> s (vecX) = vecX s 

maps V into itself. Let tt = vec o v, and notice that 7r : R m — > V is a linear 
isomorphism and that tp s — ir^ 1 o (ft s o tt. By Example 19.41 if X is a positive 
definite matrix, then J^ s (vecX) — dX s /dX is also positive definite. Applying 
Lemma flO-H we conclude that the Jacobi matrix J^ s (7r _1 (vecX)) = J^ g {fi(X)) 
has positive eigenvalues at any positive definite X . □ 



Our next result verifies Theorem [TT5] for the special case S(X,Bi) — X s . 

Proposition 10.3. Let s be a positive integer, P a positive definite matrix, and 
V C O a bounded open set containing ^(P 1 / 5 ). Let g be the function <p 8 restricted 
to O. Then 

deg( 5j V,M(P)) = l. 
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Proof. Lemma \l . 2 1 implies that (i(P) is a regular value for g. Using Theorem 17.31 
and Lemma ri0.2[ we calculate: 

deg( 5 , V, m(P)) = s § n dct W = s § ndet W-P 1/s )) = 1- 



□ 



The following straightforward fact will be used in our proof of Theorem 11.51 



Lemma 10.4. Let V be the set of positive definite matrices of norm less than 
K > 0. Then the boundary ofV is given by 

dV = {X e V : det(X) = 0} U {X e V : \\X\\ = K}. 

We are now ready to calculate the Brouwer degree of a general symmetric word 
equation; the boundedness part of Theorem 11.51 follows from Lemma 18. II 



Proof of Theorem \1.5\ From the discussion in Section [6] we may assume that our 
interlaced symmetric word S(X, B\, . . . , Bf.) is of the form ((4|. Fix positive definite 
matrices B\, . . . , Bk, a positive definite matrix P and set f(X) = S(X, Bi). Also 
set / = /i o f o v. We will show that there is a bounded, open subset V C O such 
that 

(15) deg(/ ! V,M(P)) = l. 

Let t G [0, 1]. By Proposition l8.2l there exists a constant K independent of t such 
that any positive semidefinite solution X of the equation S(X, tBi + (1 — t)I) = P 
has \\X\\ < K. Indeed, if 

a= max \\tB t + (1 - t)I\\ ■ \\(tBi + (1 - t)/)" 1 1| < oo 

KKfc, 0<t<l 



and 



(3 = min \\tB l + (1 - t)I\\ > 0, 

KKfc,0<«l 



then we must have 

\\X\\ <C s , a P~^ \\Pt> <oo. 

Let V = Vk be the open set of positive definite matrices with norm less than 
K. For each t € [0, 1], let ft be the function from the positive semidefinite matrices 
into itself given by ft(X) = S(X,tBi + (1 — t)P). From our choice of K, it follows 
that ft(X) ^ P when X is positive definite with ||X|| = K. Moreover, if X is 
singular, then taking a determinant shows that ft(X) ^ P. Thus P £ f t (dV) for 
each t E [0, 1] by Lemma [TOl 

Set V = fj,(V) and f t = fj, o f t o v. Then we have fi(P) f t {dV) for t e [0, 1]. 
Since (x, t) >— > /t(x) is continuous, Theorem 17.41 implies that 

deg(/ ,V,/i(P)) =deg(/ 1) V,MP)). 
Since /q = and fi — f, (fT5|) now follows from Proposition ll0.31 □ 
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11. NONUNIQUENESS OF SYMMETRIC WORD EQUATIONS 

The following corollary of Theorem 11.51 is a crucial ingredient in our proof of 
Theorem 

Corollary 11.1. Fix an interlaced symmetric word S and let f — fs be as in the 
proof of Theorem \1.5l Suppose there is a positive definite matrix Xq such that 

det JjQi(X )) < 0. 

Then the symmetric word equation 

S(X,B i ) = S(X ,B i ) 

has at least two real solutions X. 

Proof. Let Xq be as in the statement of the corollary, and set P — S(Xo,Bi). If 
n(P) is a regular value of /, then Theorems 11.51 and 17.31 imply that there must be 
at least two solutions X\ and X2 of S(X, B>i) = P such that 

det Jr(/i(JQ)) > 0, i = l,2. 

If fJ-(P) is not a regular value of /, then there exists a positive definite matrix X\ 
such that S(X 1 ,B i ) = P and 

Jj(KXi)) = 0. 

Since Jj(n{Xi)) ^ Jj{p{X Q )), it follows that X ^ X x . □ 

Let S and / be as in Corollary 111.11 We outline a method for obtaining the 
smaller Jacobian matrix Jj(^t(X)) from the larger Jacobian matrix dS/dX. To 
simplify the bookkeeping of indices, define 

a(i,j) = n(j - 1) +i, i,j = l,...,n 

and 

(3(k,l) = -(2n- l)(l - 1) + k, l<l<k<n. 

Thus if X — [xij] G M„, then the a(i, j)th entry of vecX is equal to x^, i,j — 
1, . . . , n. Likewise, the [3(k, l)th entry of fi(X) is Xki, 1 < I < k < n. 
The Jacobi matrix Jj of the map 

/ = ^o(X^S{X i B i ))ov 

— (/1 o vec -1 ) o (vec o (X S(X, Bi j) o vec -1 ) o (vec o v) 

is given by 

JjOPO) = M(dS/dX)N, 

in which M £ IM m x<i is the matrix representation of /1 o vec -1 and N 6 IMdxm is 
the matrix representation of vec o v. It is easy to see that if 1 < i,j < n and 
1 < / < k < n, the (a(i,j),j3(k,l)) entry of N is 

1 if i = k,j = l or i — l,j = k 
otherwise 

and the ((3(k, I), a(i,j)) entry of M is 

1 if i = k,j = I 
otherwise. 
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Example 11.2. When n — 2, the matrices M and N as described above are: 

1 
1 



M 



N 



We are now ready to prove the main result of this section. 

Proof of Theorem \l-4\ Let A\ and B\ be as in Section[4l and let S be the symmetric 
word 

S(X,B) = XBX 2 B 3 X 2 BX. 

Let f(X) = S(X,Ai) and / = fj, o f o v. Using Mapkfl, we calculate 

det Jj{B x ) = -633705909477329213831177437148144640 < 0. 

By Corollarv lll.il it follows that the symmetric word equation 

S(X,A 1 ) = S(B U A 1 ) 

has at least two distinct real positive definite solutions X. □ 

Remark 11.3. Despite many numerical attempts using generalized Newton meth- 
ods, we were unable to produce any of these other solutions. It appears that the 
ill-conditioning of A\ and B\ necessary to achieve a negative determinant causes 
difficulty for numerical equation solvers. 

We note that there are many other words which can be shown to have multiple 
solutions using the techniques found in the proof of Theorem 11.41 We list a few of 
them below: 



XBX k BX, 9 < k < 20 
XBXB 2 X k B 2 XBX, 2 < k < 16 
XBX k B 3 X k BX, 2 < k < 15 
XB 2 XBX k BXB 2 X 1 6 < k < 40. 



In general, we do not know how to characterize those equations which give rise to 
unique solutions. 

The techniques we have developed here can also be used to show unique solv- 
ability of some particular symmetric word equations. As an illustration, we close 
this section by proving that the word equation S(X, B) — XBX 2 B 3 X 2 BX = P 
is uniquely solvable in 2 x 2 real positive definite matrices. From Example 19.5) we 
have 

A Q 

= XBX 2 B i X 2 B ig) / + XBX 2 B 3 X ®XB + XBX 2 B 3 ® XBX 

(16) dX 

+ XBX (g> XBX 2 B 3 +XB(g> XBX 2 B 3 X + 1 <g> XBX 2 B 3 X 2 B, 

in which I is the 2x2 identity matrix and X and B are 2x2 positive definite 
matrices. 



^Code that performs this calculation and the one found in Theorem 111.41 is available at 
http:/ /math. berkeley.edu/~sarm or http:/ /www. math. tamu.edu/~chillar 
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Theorem 11.51 and the remarks in this section reduce unique solvability to a verifi- 
cation of whether det[M (dS '/ dX)N] > for all 2 x 2 real positive definite matrices 
X and B. This observation is used to prove the following theorem. 

Theorem 11.4. The word equation S(X, B) = XBX 2 B 3 X 2 BX = P has a unique 
real solution for all 2 x 2 real positive definite matrices B and P. 

Proof. It is clear that in the theorem statement we may assume B is diagonal 
(perform a uniform unitary similarity) and that X has determinant 1 (homogeneity 
of (HHJ)). We may, therefore, parameterize X and B as 



X = 



X 


y 


, B = 


a 


" 


. y 


i+y 2 

X 





b _ 



With the assistance of Maple, the expression det[M(dS/dX)N] factors into a prod- 
uct of terms that are easily seen to be positive: 

i^l(3aV + 3&V° + 15&V + 30&V + Wb 5 y 6 + 15&V + ia 4 bx 8 + 

x° 

22a 2 b 3 x 4 y 4 + 12a 2 b 3 x 4 y 6 + 15a 2 b 3 x 2 y 2 + 21a 2 b 3 x 2 y 4 + 21a 2 b 3 x 2 y 6 + 6a 2 b 3 x 2 y 8 + 
5ab 4 x 6 y 2 + 3b 5 + 6a 5 x 8 y 2 + 3a 5 x 6 y 2 + 3a 5 x 6 y 4 + 3a 2 b 3 x 2 + 3ab 4 x 2 + 6b 5 x 4 y 4 + 
lSb 5 x 2 y 6 + 3b 5 y 2 x 4 + 6b 5 y 2 x 2 + 18b 5 y 4 x 2 + 3b 5 y 6 x 4 + 6b 5 y 8 x 2 + 3a 3 b 2 x 8 + 
8ab 4 x 4 y 2 + 6a 3 x 8 y 2 b 2 + 10a 3 x 6 y 2 b 2 + 12a 3 x 6 y 4 b 2 + la 3 x 4 y 2 b 2 + VSa 3 x 4 y 4 b 2 + 
6a 3 x 4 y 6 b 2 + 3ax 6 y 4 b 4 + Uax 4 y 4 b 4 + 6ax 4 y 6 b 4 + \2ax 2 y 2 b 4 + 18ax 2 y 4 b 4 + 
\2ax 2 y% 4 + 3ax 2 y 8 b 4 + 3a 4 bx 8 y 2 + 8a 4 bx 6 y 2 + 6a 4 bx 6 y 4 + 5a 4 bx 4 y 2 + 8a 4 bx 4 y 4 + 
3a 4 bx 4 y 6 + 7a 2 b 3 x 6 y 2 + 6a 2 b 3 x 6 y 4 + I0a 2 b 3 x 4 y 2 ). 

This completes the proof. □ 

We close with a conjecture that vastly generalizes this last result. 

Conjecture 11.5. Symmetric word equations in 2 x 2 positive definite matrices 
have unique solutions. 

Remark 11.6. Standard transformations for 2 x 2 matrices reduce the general 
problem to the real case. Therefore, for instance, Theorem 111.41 is fully general in 
the sense of this conjecture. 
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